Search CORE

8 research outputs found

OVERLAPPED-SPEECH DETECTION WITH APPLICATIONS TO DRIVER ASSESSMENT FOR IN-VEHICLE ACTIVE SAFETY SYSTEMS

Author: Amardeep Sathyanarayana
John H L Hansen
Navid Shokouhi
Seyed Omid Sadjadi
Publication venue
Publication date: 03/04/2020
Field of study

ABSTRACT In this study we propose a system for overlapped-speech detection. Spectral harmonicity and envelope features are extracted to represent overlapped and single-speaker speech using Gaussian mixture models (GMM). The system is shown to effectively discriminate the single and overlapped speech classes. We further increase the discrimination by proposing a phoneme selection scheme to generate more reliable artificial overlapped data for model training. Evaluations on artificially generated co-channel data show that the novelty in feature selection and phoneme omission results in a relative improvement of 10% in the detection accuracy compared to baseline. As an example application, we evaluate the effectiveness of overlapped-speech detection for vehicular environments and its potential in assessing driver alertness. Results indicate a good correlation between driver performance and the amount and location of overlapped-speech segments

CiteSeerX

Consistent Estimation of Dimensionality for Data-Driven Methods in fMRI Analysis

Author: Abd-Krim Seghouane
Navid Shokouhi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Two-Dimensional Whitening of Face Images for Improved PCA Performance

Author: Abd-Krim Seghouane
Navid Shokouhi
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Towards Developing a Distraction-Reduced Hands-Off Interactive Driving Experience using Portable Smart Devices

Author: Hansen John
Sathyanarayana Amardeep
Shokouhi Navid
Thomsen Nicolai
Zheng Yang
Publication venue: 'SAE International'
Publication date: 05/04/2016
Field of study

VBN

CRSS systems for 2012 NIST speaker recognition evaluation

Author: Gang Liu
Hynek Bořil
John H L Hansen
Navid Shokouhi
Omid Sadjadi
Seyed Taufiq Hasan
Publication venue
Publication date: 01/01/2013
Field of study

This paper describes the systems developed by the Center fo

CiteSeerX

The I4U Submission to the 2012 NIST Speaker Recognition Evaluation

Infoscience - École polytechnique fédérale de Lausanne

I4U Submission to NIST SRE 2012: a large-scale collaborative effort for noise-robust speaker verification

I4U is a joint entry of nine research Institutes and Universities across 4 continents to NIST SRE 2012. It started with a brief discussion during the Odyssey 2012 workshop in Singapore. An online discussion group was soon set up, providing a discussion platform for different issues surrounding NIST SRE’12. Noisy test segments, uneven multi-session training, variable enrollment duration, and the issue of open-set identification were actively discussed leading to various solutions integrated to the I4U submission. The joint submission and several of its 17 sub-systems were among top-performing systems. We summarize the lessons learnt from this large-scale effort

Infoscience - École polytechnique fédérale de Lausanne